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Abstract 

We consider random n x n matrices of the form 

(XX* + YY*y^ YY* (XX* + YY*Y^ , 

where X and Y have independent entries with zero mean and variance one. These matrices 
are the natural generalization of the Gaussian case, which are known as MANOVA matrices 
and which have joint eigenvalue density given by the classical Jacobi ensemble. We show that, 
away from the spectral edge, the eigenvalue density converges to the limiting density of the 
Jacobi ensemble even on the shortest possible scales of order 1/n (up to logn factors). The 
result is the analogue of the Wigner semicircle law and the Marchenko-Pastur law for general 
MANOVA matrices. 
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1 Introduction 



The three classical families of eigenvalue distributions of Gaussian random matrices are the 
Hermite, Laguerre and Jacobi ensembles. Hermite ensembles correspond to Wigner matrices, 
X = X*; Laguerre ensembles describe sample co variance matrices, XX*. The random n x n 
matrices yielding the Jacobi ensembles have the form 

(XX* + YY*y^ YY* (XX* + YY*y^ , (1) 

where X and Y are n x [bn] and n x [an] matrices with independent standard Gaussian entries. Here 
a, b > 1 are fixed parameters of the model, n is a large number, eventually tending to infinity, and 
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[■] denotes the integer part. The matrix entries can be real, complex or self-dual quaternions, cor- 
responding to the three symmetry classes, commonly distinguished by the parameter (3 = 1, 2, 4, 
respectively. The results in this paper are insensitive to the symmetry class and for simplicity we 
will consider the complex case (j3 = 2). 

Matrices of the form (OQ) are used in statistics for multivariate analysis of variance to determine 
correlation coefficients (Section 3.3 of [fT3l ). This analysis is called MANOVA, though it has been 
largely limited to the special case when the entries of (OQ) are Gaussian. 

In this paper we address the case when the entries of X and Y in (OQ) are independent but have 
general distribution with zero mean and unit variance. In particular, the matrix entries are not 
required to be identically distributed. We will call such a matrix with general entries a general 
MANOVA matrix. 

Similarly to the Wigner and sample covariance matrices, the joint eigenvalue density of (OQ) is 
explicitly known only for the Gaussian case. When the entries are standard complex Gaussians, it 
is given by 

n 

density^, . . . , X n ) = C aAn J] Af" 1)n (l - Aj) (b_1)n J] \X 3 - X k \ 2 , A, e [0, 1], (2) 

j=l l<j<k<n 

where C a ^ n is a normalizing constant. The density has a similar form with different exponents 
when the matrix entries are real, or self-dual quaternions, see Section 3.6 of IfTTI . Equation © 
defines the Jacobi ensemble, where the name refers to the form of the polynomial term in front of 
the Vandermonde determinant in ©. 

The empirical density of the eigenvalues of (OQ) - equivalently, the one-point correlation func- 
tion of © - converges almost surely, as n — > oo , to the distribution with density given by 

„ / \ „ . \ (x — A_)(A_i_ — x) , i 

/»M = (° + ») V 2^(1-1) - J ^M (3) 

where 



A± =i'/^( 1 -^) ± V^( 1 -^n (4) 

(see Section 3.6 of El). Note that A± G (0, 1), so that f M is supported on a compact subinterval 
of (0, 1). We will refer to /m(^) as the limiting distribution of the eigenvalues of (OQ) or as the 
MANOVA distribution. 

While the joint eigenvalue density © is valid only for the Gaussian case, the limiting empirical 
density is expected to be correct for general distributions as well, similarly to the universality of 
the Wigner semicircle law for Wigner matrices or the Marchenko-Pastur (MP) law for sample co- 
variance matrices. Thus, general MANOVA matrices, the Jacobi ensemble and the distribution f M 
constitute a triplet analogous to general Wigner matrices, the Hermite ensemble and the semicircle 
law or sample covariance matrices, the Laguerre ensemble and the Marchenko-Pastur law. 

Universality results have been intensely pursued for the latter two types of matrices, starting 
from the fundamental work of Wigner ll20ll and Marchenko-Pastur 021 who identified the corre- 
sponding distributions. These first results were on the macroscopic scale; the empirical density 
on spectral scales containing 0(n) eigenvalues were shown to converge in a weak sense to the 
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limiting law. Recently local versions of these fundamental laws have also been established on the 
shortest possible scale, containing 0((p(n)) eigenvalues, where (f(n) is a factor logarithmic in n. 
For Wigner matrices it was achieved first in the bulk [|4l then optimally up to the edges [0. 
For sample covariance matrices the optimal scale in the bulk was reached in 0, followed by the 
optimal result up to the edge in [TT5l . Related results were also obtained in [fT8l[T9l[T4l . 

In this paper we prove the local convergence of the density on the optimal scale for the general 
MANOVA ensembles in the bulk spectrum. This establishes the analogue of the results [|51 13 
for these ensembles. We remark that the convergence even on the largest scale, i.e. the analogue 
of lEOimi . has not been known before. 

Precise results on the local density have opened up the route to establish the full universality 
of local eigenvalue statistics for Wigner and sample covariance matrices, including precise identi- 
fication of the statistics of consecutive gaps. A new general method based on the Dyson Brownian 
motion (DBM) was first introduced in [6]. It is applicable to all symmetry classes [7], to very gen- 
eral distributions [9] and to sample covariance matrices lfT5l . The local semicircle law (or the local 
MP law) is a basic input in all these works. Local density results have also inspired an alternative 
route to universality lfT9l[T8l that is applicable for the complex case, (3 = 2. 

In light of these developments for the Wigner and sample covariance matrices, the current work 
is the first step towards establishing the full universality of eigenvalue statistics for the general 
MANOVA ensemble. Optimal error bounds for the density and adaptation of the DBM method to 
this ensemble are in preparation. 



2 Statement of the Main Result 

Given two positive constants 7 = (71,72), we say that a complex random variable Z is 7- 
subexponential if it satisfies the following conditions: 




(5) 



< 7 2 e~* for all t > 0. 



A set of random variables is uniformly 7-subexponential if each random variable is 7- 
subexponential for a common 7. Assuming that the matrix elements of X and Y are independent, 
uniformly 7-subexponential random variables, we will prove that the empirical distribution of the 
eigenvalues of (QQ) in the bulk converges on small scales to © as n — > 00. 

The main tool for this approach is the Stieltjes transform. The Stieltjes transform of a real 
random variable with distribution function F is a function C + — V C + defined by 



m(z) = / dF(t). (6) 

J t-z 

If the random variable has a density, then we also refer to the Stieltjes transform of the density. 
The Stieltjes transform of /m is 



(2-a-b)z + a-l+ J {a + b) 2 z 2 - (a + 6)(2(a + 1) - ~^j~z + (a - l) 2 

m M (z) = — r . (7) 

2z{l — z) 
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This formula is derived in Appendix lAl 

For self-adjoint matrices, we misuse notation and refer to the function 

m A (z) = —tr(A - zlY 1 
n 

as the Stieltjes transform of the self-adjoint, n x n matrix A. If Ai, . . . , A n are the eigenvalues of 
A, then we equivalently have 



( \ 1 1 



n ^— ' Ai, 

k=i K 

which is the Stieltjes transform of the empirical measure. 

Our main result shows that the eigenvalues of the general MANOVA matrix behave close to 
what is indicated by itim(z) and fu m the bulk with high probability. To state the result, we must 
formalize the term bulk. Following this definition we state the main theorem, Theorem 12.21 

Definition 2.1. Let A + and A_ be as given in dU). Define 

£j$ := {E + irj G C + : Ee (A_, A+) and (A+ - E)(E - A_) > k} 

and set Si^ = S^q. 

Theorem 2.2. Fix two real parameters a,b > 1. Let X be an n x an random matrix and let Y 
be an n x bn random matrix independent of X. We assume that both matrices have independent 
entries satisfying © for a common 7 = (71,72). Let m nt M(z) be the Stieltjes transform of the 
general MANOVA matrix 

(xx* + YY*y^ yy* (xx* + yy*) _ 3 . (8) 

i) Then for any k, rj > with r\ > ^ (logn) 2C,loglogn , we have 

((]c>pr)) cloglogn \ 
sup \m nM (z) - m M (z)\ > 1 8 < n" cloglog " (9) 

for all n > n large enough and for constants C, c > 0. Here n , C and c depend only on 7. 

ii) Let Af v (E) denote the number of eigenvalues of © contained in [E — |, E + ^] and assume 
V > A(logn) 3Clogl °s n . Then 



F sup 



N " {E) ME) 



nrj 



(lognf^y I <ri -clo g logn 



We note that the entries of the matrices X and Y are not necessarily identically distributed. 

Theorem 12.21 shows that the Stieltjes transform of the general MANOVA matrices is close 
to rriM(z) when the real part of z, E = 3lz, is away from the spectral edge and the imaginary 
part 77 = Qz is small. In particular, 77 may be as small as the shortest possible scale 1/n, up to 
logarithmic corrections. The second part of Theorem 12.21 is an easy consequence of the first one 
and it asserts that the local density on scale 1/n (modulo logarithmic corrections) is given by the 
Manova density /m(E). While our analysis is valid down to the optimal scale n > 1/n, the error 
bound of the form (nnny 1 / 2 is not optimal. The best estimate should scale like (nrj)' 1 and should 
not blow up near the edge, similarly to the best estimate in the Wigner case [9] and in the sample 
covariance case [15]. Work to obtain the optimal error bounds is in preparation. 
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2.1 General Approach 

We start with the observation that the eigenvalues of (OQ) are the same as those of 

(YY*)^(XX* + YY*y 1 (YY*)^ = (I + {YY*)-^XX*{YY*)-^y 1 . 

We begin our work with the matrix (YY*)~^ XX*(YY*)~^ , which we will call the product matrix 
and for which we can use the approach developed in [[31 SI [Vj [T3]| and related papers. This approach 
determines an implicit equation for the Stieltjes transform of the limiting empirical eigenvalue 
distribution and shows that the solution is stable. Then it is shown that with high probability 
the Stieltjes transform of the empirical distribution nearly satisfies this implicit equation. From 
stability we conclude how close the empirical distribution is to its limit. 

After obtaining results for the matrices {YY*)~2 XX*{YY*)~? , we return the matrices of our 
original interest. Note that if p is an eigenvalue of (YY*y^XX*(YY*)~^, then A = (1 + /i)" 1 
is an eigenvalue of <0Q). The eigenvalues of (YY*)~2 X X* (YY*)~2 are non-negative so that the 
map just given is regular. This allows us to show that if the eigenvalues of (YY*)~ 2 X X* (YY*)~2 
are distributed close to their limiting distribution, then the eigenvalues of £0 are also close to their 
limiting distribution. 

2.2 Conventions 

We make the following conventions, which will be used without referring to them. The letters C 
and c will denote positive absolute constants that may change from appearance to appearance. We 
use the complex number z = E + in for the spectral parameter, where r] will always be positive. 
The edges of the limiting spectrum A_ and A + are given in © and we set 

1 1 

(£- = -: 1 and fi+ = - 1, (11) 

A_|_ A_ 

to be limiting spectral edges of the product matrix. 

Rather than writing [an] and [bn] we will write an and bn, and it is implicit that we are using 
the integer part. Most quantities in this paper depend on the parameters a and b, but we usually 
omit this fact in the notation. 

3 A Product Matrix 

We begin by addressing the eigenvalues of the product matrix (YY*)~2 X X* (YY*)~2 . Denote 
by f p the limiting empirical spectral distribution for matrices of the form (YY*)~ 2 X X* {YY*)~2 
with Gaussian entries. This distribution, which will be derived below, is explicitly given by 

w*) = ««) = §^/(tti-*-) ( a+ -tti) (rb)- (12) 

where I\ u ,v] is the characteristic function of the interval [u, v] . We denote this distribution's Stieltjes 
transform m p , and we will use the subscript p for functions associated with the product matrix. The 
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main theorem of this section, Theorem 13.21 relates the eigenvalues of (YY*)~^XX*(YY*)~^ to 
the functions f p and m p . It is followed by a derealization result, Theorem 13 .31 Our main result, 
Theorem 12.21 will also follow from Theorem 13 .21 

The following definition formalizes the bulk for the product matrices; it is the analogous region 
to figj. 

Definition 3.1. For A± and p,± as defined in (HJ) and (fTTI) and for k > 0, define 

£^:=! [ E + iV eC+: £g(u_,u + ) and (\ + -^—^ f-L- - A_) > 

and se£ = £ Ki o 

The following is the main result on product matrices. 

Theorem 3.2. Let X be an n x bn, b > 1, random matrix with independent entries satisfy- 
ing ([5]) for a uniform 7. Let Y be an n x an, a > 1, random matrix independent of X with 
independent entries also satisfying ©/or the same 7. Let m n>p {z) be the Stieltjes transform of 
(YY*)~ 1/2 XX*(YY*)~ 1/2 . Fix k > to be a small positive constant. 

i) Then for rj > ^(logn) 2Cloglogn 

(C\aen) cioglosn \ 
sup \m n Jz) - mJz)\ > { S ; < n" cloglogn (13) 

zeski ' VW^ J 

for all n large enough and for constants C, c > depending only on 7. 

ii) LetAf v (E) denote the number of eigenvalues of (YY*y 1 / 2 XX*(YY*y 1 / 2 in [E-*,E+ 2], 
and assume rj > -^-(logn) 3C ' loglog ' 1 . Then 



P sup 



M " {E) ME) 



nrj 



(lognf^ . oglogn , 
- (r^)V4 I - n 



Theorem 3.3. Set A := (YY*) 1/2 XX* (YY*) 1/2 and assume that X, Y satisfy the same condi- 
tions as in Theorem \3.2\ Then 

3 v e C n , \\v\\ 2 = l,Av = fiv, /i e and (MU > y S ^_ J < n - cloglogn . 

To prove Theorem 13.21 we first determine the implicit equation for m p . We recall that the 
corresponding implicit equation for both the Wigner semicircle law and the Marchenko-Pastur law 
is a simple algebraic equation. In the case of the product matrix, it turns out that m p satisfies an 
implicit integral equation with good stability properties. In Section I3TT1 we show that the Stieltjes 
transform of the empirical density, m n . p , approximately satisfies the implicit equation for m v and 
we identify the error term. The error will be controlled in Section 13.41 after several preliminary 
lemmas in Section l3~!2l 
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3.1 Stieltjes Transform of the Product Matrix 



The first step of the proof is determining an implicit equation for the Stieltjes transform of the 
product matrix. While determining this equation, we initially view (YY*)~ l as a fixed matrix and, 
adjusting the scaling, set T := (^YY*)^ 1 . 

Random covariance matrices of the form — YY* have limiting distribution 

an 



fMP,a( x ) 



fMp{x) = ~ 
Z7T 



\ 



a 2 



x 



X 



(1 



_1, 

a 2 



(15) 



which is called the Marchenko-Pastur distribution [[121 . The local Marchenko-Pastur law was 
obtained in 0, and the optimal result for the edge was obtained in lfT5l . The random matrix T 
then has limiting distribution 



flm,a( X ) = /invOz) 



1 f ( l 

;JMP — 



ax" 



2nx 



ax 



a 2 



Note that / Inv is supported on [a_ 



where a± = (a(l =F 



1 

ax 
1 



1 

ax 



_ 1 . 
a 2 



(16) 



Here we also address the distribution for f p . When the entries of X and Y are Gaussian, the 
limiting eigenvalue distribution of (YY*)~ 1/2 XX*(YY*)~ 1/2 can be obtained from the MANOVA 
distribution f M using the transformation described in Section [27TI It is 



c n 



(i + xy 

C a ,b 



l + x 




(17) 



We use m p (z) to denote the Stieltjes transform of f p . We remark that no explicit formula is avail- 
able for m p , unlike the case of the semicircle or the Marchenko-Pastur law but it satisfies an integral 
equation. By Lemma 5.1 of IfTTTl the function m p is the unique solution taking values in C + to the 
implicit equation 

m{z) = [ — j 1 — 7 fm v (X)dX. (18) 



to m„ . Thus, ran (z) denotes 



' \(1 - I - l zm ( z )) - z - 

For the remainder of Section [3] we will shorten the notation m n>p 
the Stieltjes transform of the product matrix (YY*)-^XX*(YY*y^ = ±T^XX*T^. The goal 
is to derive a self consistent equation for m n (z) that is close to (fT8l . We will then establish the 
stability of ([TBI and conclude that m n (z) is close to m p (z). 

The derivation of the equation for m n (z) follows Silverstein and Bai in [fT71[T6l . Here we set 
N = bn. Let I be an n x iV matrix, and T an n x n positive definite matrix. Then m n is the 
Stieltjes transform of ±T^XX*T^ and let m n be the Stieltjes transform of ±X*TX. Since the 
nonzero spectrum of A A* and A* A coincide for any matrix A, we easily get that 



niiz) 



-in 

z 



l \ 1 / N 

- + -m n (z). 
aJ a 



(19) 
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We let Xj denote the j th column of X. Set 



.1 



AT J1 " N 

3=1 *=1,«^3 

We let m^(z) be the Stieltjes transform of and set 

my) ( 2 .) = -I(l-I) + im ( ,- ) (z). (20) 

We will use the following simple formula: if A is an n x n matrix, q E C n and both A and A + gg* 
are invertible, then 

q*(A + qq*)' 1 = , , * _ t g*^" 1 - (21) 
1 + q*A~ L q 

Using the definition of _B n we obtain the identity 

iV 



I + z(B n -zI)- 1 = ^rrfiBn-zI)- 1 . 

3=1 



Using (T2TI) . for each j 



r*(5 n - ziy' = r*(B w - zl + r^)" 1 = - + g -| _ - (22) 



so that 

N 



I + z(B n - zlY 1 = V ; ——nrUBd) - zl)~ x . 

We take the trace on each side and divide by N 

- + z-mJz) - 



a a 



^ 1 + J^^r^-ziy 



1 r*{B {]) -ziy\ 
N 2^ l + r * {B{j) - z I)-h 

N 



1 



so that 



1 / 1\ 1 . , 1 1 A 

1 H — Tn n (z) = — > - 

z\ a) a v ' zN ^ 1 

3 



^iV^i + r *(B w -^)-ir/ 



Using (1191) we have 



1 1 " 



™-.W = -^E 1 + r . (B(j) _ z/) - lr , («) 
8 



2 

3 



With the resolvent identity and identities (1221) and (|23l ). 



(-zm n (z)T - ziy 1 - (B n - zly 1 



N 



J =1 



N 



{-zm n {z)T - zl) 1 J^Y 



{ l + r*{B {3) -ziy 



rj r*(B w -ziyi 



—(—zm n (z)T — zI) 1 Try^^ ryr; t n 

N^l + r*(B u) -zI)-^ 

1 - 1 

z Z^ 1 + r * {B{ . ) _ zI) -i r . 

x [(-m n {z)T - iy l r r]{B {j) - ziy 1 - ^(-m n (z)T - iy l T{B n - ziy 1 
Taking the trace of (1241) and dividing by n we have 

M 

1 



A 1 



— T(B n - ziy 1 



1 a N 

-tr(- zm n (z)T - ziy 1 ) - m n {z) = — ^ y 



where 



dj(z) = r*(B U) - ziy\-m n {z)T - iy'r, - ^-tr(B n - ziy\-m n (z)T - I)~ X T. 
We break this into four terms 



(24) 



(25) 



\d 3 {z)\ < ^(B^-ziy^m^T + iyy.-r^B^-ziy^m^T + iy 1 ^ (26) 



+ \r*(B (j) - ziy^m^zy + I)' 1 ^ - — ~ zl)" 1 (rn {]) {z)T + T)~ X T\ (27) 
+ — tr(% - ziy l {m {j) (z)T + 1)^ - —tr{B {j) - zl)~ x (m n (z)T + I)~ l T (28) 

+ —tr(B (j) - ziy 1 (m n (z)T + T)- X T - — tr(B n - ziy\m n (z)T + I)~ l T . (29) 

an x ' an 

We will bound \dj(z) \ in Section [3~4l using the Lemmas developed in Sections I3T21 and l33l 



3.2 Large deviation estimates for exceptional events 

In this section we will define certain typical events, denoted by X , 7i, % and B, that will be needed 
to estimate |dj(;z)|. Recall that Xj denotes the j th column of X and let X(j) denote the matrix 
obtained by removing the j th column of X. Define the event 



x rl<^k <2 and (\-^-\ 
12- n ~ V Vb 



n 



X (j) X (j) 



< Ab for all 1 < j < (30) 
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For some small c > 0, let T± denote the event 

-l 



Ti := {T = (^YY*\ is well-defined and (1 - c)a_ < T < (1 + c)a+j. (31) 

Let {ti, . . . , t n } denote the eigenvalues of T. Let {tx, . . . , r n } denote their classical locations given 
through the limiting density / Inv from (fT6l) . i.e. they are defined through the formula 

fi m (x)dx = -, fc = 1, . . . , n. (32) 



Let 



T 2 := {- V \t k - r k \ < -(logn) CTloglos "), (33) 
fc=i 



denote the event that the actual eigenvalues are close to their classical location, where C T > is a 
constant independent of n. 

(i) (i) 

Let fXi,..., denote the eigenvalues of By) for j = 1, . . . , N. For a fixed constant K , let 
B be the event that 

1 nl 1 1 n_ 1 1 iT 
B := < sup max — > — < i^(logn) 2 > fl < sup max — > — ^ < — L 

I , e£(M) i<i<iv n ^ |^0) _ z \ - J I ze£(M) i<i<iv n ^ _ z \2 v i 

(34) 

where 

fOO : ={ z = E + ir,€C :E (\+ - ^) (— |^ - A_) > «, < r? < l}. 

Proposition 3.4. WiY/z ?/ze notations above, the following estimate holds 

p(^nTinr 2 ni3) > i-n- cloglogn . (35) 

We first prove several lemmas. 
Lemma 3.5. Let {t\, . . . ,T n } be the classical locations w.r.t. the Marchenko-Pastur law, i.e. 

/ f M p(x)dx = -, fc = l,2,...,ra. (36) 



Assume that the points {t±, . . . , t n } satisfy 

\h -r k \ <5, min(ti, ...,t n )>a>0 (37) 
for some 5, a > 0. Set t k : = (at n+ i_fc) _1 . Then, recalling the definition of r k from (1321) , we have 



\ r k ~ tk | ^ p J^~2 ' k = 1, 2, . . . , n. 

aa ( 1 — a^2 
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Proof. The smallest point contained in the support of Jmp is (1 — a 2 ) 2 , so given (1361) we have 



—Imp \ TZ ) dx = -■ 

{aT k )- L 



(IX s 



ax 



n 



Considering the relation between and fup (see ([761)). it follows that r k := (ar n +i_fc) 1 . By 
the assumption 071 ) we have 



1 



1 



ar k at k 



< 



aa(l — a 2' 



for all k — 1, 2, . . .n. □ 

The following lemma is a variation of Lemma 4.7 of (H. 

Lemma 3.6. Let the entries ofb^C n be uniformly ^-subexponential independent random vari- 
ables satisfying Kbi = and IE | fc»« | 2 = 1 for 1 < % < n. Let S be an n x n positive definite 
matrix satisfying s min < S < s m3iX for < s m i n < 1 and 1 < s max < 00. Let be a 

set of orthonormal vectors in C n and set £j = \b* Svi\ 2 for i = 1, . . . , n. If e > and m satisfy 
|(1 — e)s^ lin y / m > s^ ax (log n) c /or a constant C depending only on 7, ?/zen 



log log n 



i=l 



Proof. We set = S'f , for 1 < i < m. As in the proof of Lemma 4.7 in 0, we set 

n m 

Z = ^2 a k,i[ b kk - E6 fc 6j] for a M = ^ Wi(k)wi(l). 

k,l=l i=l 

We note that ^ 6 = Z + IHI 2 - Als °;E^=i M 2 = tr[E^i(^«i)«5)] a < 4**™, 
where in the last step we used that M = Y^iLi(^ v i)( v i^*) ls a matr i x of rank at most m and 
M < S 2 < s^ ax . Our assumptions on e and m give |(1 — e)s^ in m > (log ^) c 's^ iax v / m. Therefore, 
using Lemma B.2 of jH, 



m m 

P($><e£in™) < P(l^l >Elkif-«Ln^) 
i=l i=l 

< p(|Z|>(l-e)4 in m" 



< P 



+P 



k^l 

n 



k=l 



< p 



^a k: i[b k k - Eb k b t 

k^l 




> (log nfs^^/m 
> (log n) c s 2 mSLX ^i 



> (logn)°' I J2\ c 
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^akAbkh - Eb k bk 



fc=i 

< (7 n _loglogn 



> {\ogn) c [J2 



,fc=i 




Lemma 3.7. Assume E G £jt\ set I v = [E—U, andletM n denote the number of eigenvalues 



2 

of (YY*y l / 2 XX*{YY*)- l l 2 in I v . If { ^^- < 7] < E/2 then there exist constants c,K > 0, 
depending only on 7 such that 



for all large n. 



Proof. First, we use that (YY*y 1 ' 2 XX*{YY*y 1 l 2 and X*(YY*)~ 1 X = }X*TX have the 
same nonzero eigenvalues to justify working with the latter matrix. Now, following the proof of 
Lemma 8.1 in 0, we need to bound the absolute value of the diagonal entries of (^X *TX — zl)~ x . 
We consider the (1,1) entry and let X\ denote the first column of X and Xm the matrix obtained 
by removing this column from X. We use the following identity for an arbitrary matrix A, which 
can be seen by using a singular value decomposition, 

A(A*A - zPj-^A* = 1 + z{AA* - ziy 1 . 

We will use this identity for the matrix A = -^T-X(i). By the matrix inversion formula, 



1 



-X*TX - zl 



n 



-1 



:m) 



\x\Tx, - z - ±xlTX (1) (±X*TX (1) - ziy^X*Tx x 



l x * Txi _ z _ I x *ri(7 + z(^X (1) X* (1) T l 2 - ziy i )T l 2 Xl 

1 



_^ _ IzxlT^^X^X*^ - ziyiTW 

Let fii, . . . , /i n _i and ui,...,w n _i denote the eigenvalues and normalized eigenvectors of 
iTiXftXfcT*. Then setting £ fc = \u\T^xi\ 2 for 1 < k < n - 1, we have 



-X*T~ X X - zl 
n 



< 



< 



< 



1 



n 

n 



Cnr\ 



E Y.k:\p, k -E\<^ 
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Continuing to follow [7], we have 



By Theorem 1.2 in [fT5l . inequality (1311) occurs with probability at least 1 — Cn loglogn ? i.e. ~a_ < 
T < 2a + . By setting e = \ in Lemma [3761 and choosing A" large enough, we satisfy 




> \ 2 J^(\ogn)e > 2al(\o g nf. 




'E 

We now apply Lemma [3761 to obtain the claim. □ 

Lemma 3.8. Let fir\ k = 1, . . . , n — 1 denote the eigenvalues of Brj\ for an arbitrary j = 
1,2, ... N, and assume that r\ > n^ilogn) and that E satisfies (E — A_)(A + — E) > k. Then, 
when 7i holds, there exists a positive constant K such that with probability at least 1 — n~ cloglogn 



sup max 



ze£M i<j<N yn^ \p%> -z 

and 



sup max 



1 1 K 



Proof. For a fixed z G and index j, given the bound on P [j\f v > from Lemma [3771 

the claim follows from the same calculation as is done in the proof of Proposition 4.3 in [Q . This 
proves the estimate for each fized z. To obtain the result simultaneously for all z £ we notice 
that the derivatives of the functions to be bounded in (|39| ) and (|4Q|) are bounded by Cnr)~ 4 on 
£M. Thus, we may discritize £^ to Cn 2 r] 8 points and take the union bound with respect to the 
discrete grid and the indices 1 < j < N. □ 

Proof of Proposition 13.41 We will prove the following four inequalities. 

¥{X) > l- n -^o S n (41) 

P(Ti) > i- n - closlosn (42) 
P(T 2 |Ti) > i- n - closlosn (43) 
F(B\Ti) > l -n~ cloglogn . (44) 

Let x denote an arbitrary column of X. We assume that n is large enough so that (log n) c < 
and apply Lemma B.2 of [81 to the identity matrix. Then (I4TT) follows by summing the N = bn 
probabilities. Inequality (1421) follows from the much stronger statement in Theorem 1.2 of |fT5l . 

Recall the definition of Tj, . . . ,r n from (|36"1) and let . . . ,t n denote the eigenvalues of 
±XX*. Then Theorem 1.3 of O gives 

an " " ° 

\r k - t k \ < (lognf Tloglosn n- 2/3 {mm{n + 1 - k, k))~ 1/3 for all k = 1, 2, . . . n, 
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with probability at least 1 — C exp (— (log n) cloglogn ^ . Lemma [331 then implies that 

\r k -t k \< (logn) CTloglosn n- 2/3 (min(n + 1 - k, k))~ 1/3 for all k 

with the same probability. Thus, assuming %_ holds to address t\ and t n , we have shown ( |43l . 
Lastly, Lemma [3T8l yields (|44l) . Together these give the bound 



¥(x nr 1 nr 2 nB)>i-n 



-clog log n 



3.3 Integral Equation 

In this section we show that the integral equation <fT8l) that m p (z) satisfies is stable. This means 
that if m$ satisfies 

m s (z) = [ — j i — fU^dX + 5(z), (45) 



for some small 6(z), then m<5 is close to m 



v 



Lemma 3.9. Assume mg(z) G C + is analytic on the upper half plane and is a solution to the 
perturbed equation (|45l) . Let < rf < 1, k > and assume T\ holds. Let E be chosen so that 
(X+ — -g^i) {eTT ~ — K ' There exists a small universal constant c\ > such that if 

\rr^(E + irj) - m p (E + ir])\< ciV« + rj (46) 

for all r\ G [r]', 1] and 

sup \6(E + ir))\ < <Ji (47) 

7?6[t?',1] 



/or some 5i < ci a/k, 

V K + v 



m (E + irj) - m p {E + irj)\ < C 2 t \ _ (48) 



and 

\m s (E + irj) - m p (E + irj)\<C 2 . 1 , (49) 

/or a// ?7 G [77', 1], where C 2 depends only on c\. 
We define 

2™o( z ) - __ f 1 - - ) + - m p( z ) and 771,5(2;) = — ( 1 ) + -m s (z). (50) 

^ z V a/ a z V a/ a 

Notice that for 5 = 0, m<s and m s are the same as m p and m . Simple algebra shows that from (l45l) 
we have the following equation for m s : 

rns(z) (z - - [ X J UWX) = -1 + — . (51) 
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We will work mostly with instead of rris. We introduce the notation 



K = - A'(X)f lnv (X)d\ and A (A) 



a J 1 + Am p (z) 

Note that both K and A depend on z, which we omit writing here but will include in the proof 
of Lemma [3^9] The following lemma bounds K away from 1, which will be used in the proof of 
Lemma [3^9] We set 

G:={z = E + irieC: E G (//_, (/,+), \ X + - ^—^j (-^ - A_ J > re, < r, < l}. 

Lemma 3.10. There exist two positive constants C and c such that the following statements hold 
uniformly for all z = E + irj G Q. 

1. 

c < IzZlpl- 2 )! < C, and c < \m p (z)\ < C, (52) 
andm p (z), m p (z) extend continuously to the interval |/t_, C R. 

2. i 

Sm p (2) > -$sm p (z) > cyfk + 77. (53) 

5. Let m(z) be a function C + — >• C + . Suppose for some G d Q 

sup Izt?, (z) — m(^) I < c. (54) 

26G 

When 71 /zo^s w?Y/z a sufficiently small c in its definition (I3TI) . 

sup || (m(2)T + J)" 1 1| < C. (55) 

4. 



Il-Ko^l >cVk + ??. (56) 
Proof. 1 .) From (|5T|) with 5 = we have the following equation for m p (z) 

z ~ ~ I \ , x A 7 7 /mv(A)rfA. (57) 



Wp(z) a J 1 + Xm p (z) 

If {^fcjfcLi is sequence in {E+irj : _E G < 77 < 1} such that either lim^oo |m p (zfc)| = 

or linifc^oo |m (zfc)| = 00, then since < /j_ < |zfc| < 77+ + 1 for all fc, (|57|) would be violated. 
Thus, no such sequence exists, which proves (l52l) . The continuous extension follows from m 
being analytic on C + . The statements for m p follow from (l50l) and the fact that E = 'Stz is 
separated away from zero. 

2.) We define the interval J := [a_ + c(re + 77), a + — c(re + 77)]; by setting the constant c 
sufficiently small and using 77 < 1 we can ensure that |J| > ~(a + — a_). Since / p has a square 
root singularity near the edge of its support, we have f p (X) > c^Jk + 77 for all A G J. Thus 
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> Ca/k + T] / — rrr; ~dX 

J J [p ~ 



> cy/^+v, (58) 



where the last inequality follows from the fact that the length of J and the distance from E to J are 
both 0(1) and 77 < 1. We then also have 3m p (z) > ^sm p (z) > c\sJ~k + 77 maybe with a smaller 
constant c 1 . 

3.) We take the imaginary part of (1571 ) to obtain 

Sm„ 1 f X 2 ^sm n 

\EL P \ a J |l + Am p | 

so that 



TlllTl I 1 

1 - / |A(A)|7mv(A)rfA (60) 



and 



- [ |A(A)| 2 /mv(A)rfA<l. (61) 
a J 

From (I6TI ), the uniform bound on \m v (z) | and the lower bound on the support of we have 



/m v (A)<iA < a (62) 



|1 + Xm p (z)\ 2 

with some constant C uniformly for all z G Q. Suppose that 

w(z) := inf 11 + Am (z)\ 

AG [a-,a + ] 

is attained at A = X (z) for any z G Q. Note that w(z) > a_3m p (z) > Cy/n + 77 with some 
positive constant c, where we used (1531) . Since the derivative of A !->■ |1 + Xm Jz)\ is uniformly 
bounded by (l52l) . there exists a subinterval J = J(^) C [a_, a+] of length at least 010(2:) such that 
|1 + Am(z)| < 2w(z) for all A G J. Since /inv(A) has a square-root singularity at its edges and is 
bounded away from zero between the edges, we have 



C * / iTT~\ 7 ^, 2 /mv(A)rfA > — ^ / /mv(A)cZA > -==. (63) 
7 |1 + Am p (z)| 2 Aw(z) 2 Jj y/w(z) 

Therefore we have a uniform lower bound 

|1 + Xm p (z)\ > c', (64) 

for d depending only on the constant C in (|62l) . for all A G (a_, a + ) and for all z G Using 
continuity in both A and m p (z) and using that (l54l) holds with a sufficiently small c, we have 
|1 + tm(z)\ > c for all t G [(1 — c)a_, (1 + c)a+] and 2; G G if c is chosen sufficiently small. By 
applying the spectral theorem for T and (13TI) with a small c, we obtain (155T) . 

4.) The variable 2 plays no role in the remainder of the proof and so we omit it from the 
notation. By the assumption on E, we have 3m p > C\Jk + 77. The property |m p (,2)| < C and 
A < a+ give 



SM(A) 3m 



|A(A)| |1 + Am p ||m pl 



> c^sm p (z) > cy/n + 77. (65) 
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We set 



B:=- f |A(A)| 2 /mv(A)rfA, 
a J 

so that by d6B we have \K \ < B < 1. We claim that 



\B - K \ > cByjK + rj 



(66) 



for a positive constant c. 



By the lower bound on \mJz) | in (|52l ) and on 1 1 + Am (z) | in (1641 ), there exists a constant C 



such that 



3A(A) 



< CQm (z) 



(67) 



|^(A)| \1 + Xm p (z)\\m p (z)\ 

for all 2 G Q and A G (a_, a + ). Let e > be a sufficiently small constant so that 1 — 2Ce > ~ with 
the constant C from (|67l) . If 3m (z) > e, then, using (|65l) . 



15- ATo 



1 
a 
1 

a 
2 

> - 

a 



(KA)| 2 - A 2 (X)) M\)dX 

2(^M(A)) 2 / Inv (A)dA - 2i J {UA{X)){^A{X))f lm {X)dX 
(%A(X)) 2 f lm (X)dX 



> ceSmJz) / |A(A)| 2 / Inv (A)d\ 



> ceBy/n + rj. 

If 3m p (z) < e, then we set A(X) = e^ (A) |A(A)|. We note that <j) G (0, tt) since 9fA(A) > 0, 
and that is well-defined since \ A\ ^ 0. By ( |67l) we have sin^ < Ce, and by continuity, either 
< 4> < Ce or 7r — Ce < < 7T. In both cases we have cos > | if e is small. So we have 



15- ATn 



1 

a 
2 
a 
2 
a 
1 

> - 



> - 



> 



a 
1 

2a 



(|A(A)| 2 -A 2 (A))/ Inv (A)dA 
|A(A)|V^in0/ Inv (A)dA 
|A(A)| 2 cos^sin 4>fi nv (X)dX 
|A(A)| 2 (sin0-2sin 2 0)/i nv (A)dA 
|A(A)| 2 sin0/ Inv (A)dA 



-- / |A(A)| 2 sin 2 0/ Inv (A) ( iA 



> cSmJz) / \A(X)\ 2 f lnv (X)dX, 



where for the last inequality we use sin0 ~ 3y4/|y4| and (1671) . Since e depends only on C 
from (|67l ), we use (1531 ) to obtain 

l-B - Ko\ > cByf^TTj. 
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We trivially have 1 1 — K 1 > \B — K \ for any positive number B with B < 1 and complex number 
K Q with \K Q \ < B. Therefore (|56l ) follows from (1661) and from a bound B > c > 0. The bound 
£> > c > 0, follows from the fact that |A(A)| > c > on the support of / Inv and that / Inv is a 
probability measure whose support is separated from zero. □ 



Proof of Lemma |3^9~1 Throughout the proof, 5 is a function of z, but we will omit writing this 
dependence. We also fix E = $tz and we vary only rj = ^sz. Note that from definition (|50l ) we 
have 

rrip(z) - m s (z) = a(m p (z) - rn^( z )), (68) 

so that (gU) follows from (|48~1) . 
To prove ( 1481) we observe that 



z ( 1 

m P (z) - nh( z ) = (m.p(z) - m( z )) K ( z ) --(*-- 

^ ^ a \ a 



X 



1 + Amr(z) 



where 



K{z) :-- 



a J 



l+\rn s {z) l+\m p (z) 



fi m {X)dX 



{z-U TTA^y/mvlA)^) (z-U TT ^y/lnv(A)c/A 



/m v (A)dA 5, (69) 



(70) 



is obtained by taking the difference of the expressions for m p (z) and rn s (z) given in (I5TI) and ((57 
Since |2Z^(^)| < C and, for 5 small enough, < |, using (15TI) we have 



A 



1 + Amrfz 



■fi nv (X)dX 



and therefore ( 1691 ) yields 



\m p {z) -msK z ) 



From (I5TI) and (1701) we have 

_ , = i r H( 2 ) W*) , 

1 ] a J 1 + Xm p {z) l + A^z) 71 



(A)dA 



(71) 



z5 — a J 1 + Am (z) 1 + Xm s (z) 



/lnv(A)dA. 

(72) 

Since |z<J| < |, |5| < <5i, the absolute value of the second term in d72l) is bounded by C5\. Here 
we used (1641 ) and that a similar positive lower bound holds for 1 1 + Xm 5 (z) | as well, assuming that 
c\ in (1461 ) is sufficiently small. Thus, 



\K(z)-K (z) 



< 



XmJz) 



Xm 5 (z) 



XmJz) 



1 

< - 

a 



1 + Xm p (z) \1 + Xm s (z) 1 + Xm p (z) 
Xm p (z) X(m s (z) - m p {z)) 



fi m {X)dX 



+ C5 1 



1 + Xm p (z) (1 + Xm p (z))(l + Am 5 (») 
< C\mj{z) - m p {z)\ + C5 X . 



f lw {X)dX + C8 1 
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By (|46l ) and 61 < c\ y/n we now have 



\K(z)-K (z)\ <C Cl V^+V 



(73) 



for all 7] E [i]', 1]. 

Choosing c\ so small so that Cc x < |c where c is the constant obtained in the estimate (|56l ) in 
Lemma |3.10[ we have 



1 



< 



C 



Using (1711 ) we have 



|i " V^+v 



\m. P (z) - m(z)\ < 7=== 



(74) 



for all 77 € [77', 1] and with a sufficiently large constant C. 



(75) 

a 



Lemma 3.11. Let n be a probability measure supported on some interval [u, v] C R anJ to 

{rfc}^! =1 Z?e rea/ numbers such that 

k 



for k = 1, ... ,n and assume m £ C + . 77zen 



rriTk 



1 n 1 

-F — 

n ^ 1 + ? 

^/"{^fc}fc=i w another set of points in R, f/zen 

lA 1 1 " 1 

n 1 + mti. 71 ' 1 + ? 



1 + mr 



n 



< 





v — u 




m 




inf te[u,i>] 


1 + mt 


2 



k=l k=l 

Proof. For an arbitrary differentiable / we have 



1 


m 




71 inf te [ Uil) ] 


1 + mt 


2 



fe=i J 



/(r)d/x(i 



< 



Ife — a| 



sup |/'(t)|. 



Here we have 



sup 

te[u,v] 



d 1 



< 





m 






1 


+ £m 


2 



(it 1 + mt 

This proves the first claim. The second claim is proven similarly. 



(76) 

a 
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3.4 A bound on \dj\ 
For any z G Q we set 

M(z) := max (j| (m n (z)T + I)' 1 \\, max || (m {j) (z)T + iy 1 ]^ . (77) 

Lemma 3.12. Suppose that for some z G Q we have M(z) < C$ with some constant Co. Then 
there exists a constant C such that 



(\ dj (z)\ > ^0^\B,TuT 2 ,x) < Cn-*"°*« 
\ */nrj / 



for j = 1, N whenever nrj > M 2 . 

We start with a short lemma. 
Lemma 3.13. For m n and rrifj) as defined in (1201) and z = E + irj, 



\m n (z) - m(j)(z)\ < 



7l 

rjn 



for all n G N and all E G 



Proof. Using F n and to denote the corresponding distribution functions, by Theorem A.44 
of [ffll, 

sup|F n (t) -F u \t)\ < -. 

tm n 

Then 

1 f dx 7T 

\m n (z) - my) (z) < - / -j ,2 = — . 

y ' n J \x — z\ 2 rjn 



Proof of Lemma [37l2l We need to bound the terms (f26l)-([29l) and will use Lemmas [3781 and [3T0l 
repeatedly. For d26l) we use Lemma I3T31 and obtain 

1 CM 2 
-\mx{B Q) -ziy\m n (z)T + I)-\m n {z)-m^ < log 2 n. 

Similarly, 

i||(B (j) - ziy\m n {z)T + iy\m n {z) - m {j) (z))T \m n (z)T + I)" 1 ^ 

^ CM 2 I, r| _ 2 ,i 

< (tr By) -zJ 2 > 

nt] n 

CM 2 

~ (nn) 3 / 2 ' 
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where the last inequality follows from (|40l ). Using Lemma B.2 of [[81, 

p(\m\> M{logn)C ) <n"^». 



For (1271) we need to bound the absolute value of 

N 



b 



[ThB {j) - zirim^T + I^tI 



(b k k-Eb k k), (78) 

k,i 



where b k denotes the k th entry of the vector Xj. On the set B, the second inequality in ( |34l ) gives 
the following bound on the Hilbert-Schmidt norm 

illT^^-^-^m^^T + j)- 1 ^!!^ < ^iKm^^T + j)- 1 !!!!^)-^)- 1 !!^ 

n 

CM 
< . 



mn 

Thus, by Lemma B. 2 of (81, 

.(logn' l<: 



Pfirai > M l ° gnJ ) < CVr loglogri . (79) 



For (1281) . using Lemma l3.13[ (1391) and the resolvent identity, we have 

\m\ < ^IICm^Wr + ^-Cm^CzjT + ^iitrKByj-z/)- 1 ! 

< tr S^-zJ - 1 

n nn 

CM 2 log 2 n 

< — • 

nn 

For (l29l) . using the resolvent identity we see that 

(80) 

has rank one, so that a bound on the norm of (f8~0l) gives a bound on its trace. We then return to the 
expression on the left of (l80l) and use || (i%) — 1|, || (B n — ziy 1 ]] < rf 1 to obtain 

|d29}| < - .J)- 1 - (£ n - ^-^KWr + T^ll < — . 

n n?7 
Combining these four bounds just obtained gives the statement of the lemma. □ 
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3.5 Proof of Theorem 1X2 



We isolate three parts of the proof of Theorem 13 .21 in the following lemmas. 

Lemma 3.14. Let z = E + irj with E e and r/ > n~ 1 K~ 2 (\ogn) 4CTlogloen , and assume 

that 7i and X hold. Then with probability at least 1 — n~ cloglog? \ 



1 



mm 



. |1 + r*(B U) - zl)- x rj\ > ; rTT . (81) 

The proof of Lemma [3 . 1 41 is given following the proof of Theorem 13 .2[ 

Lemma 3.15. Assume E G (//_,//+) and (A + — g-j) (grj — A_) > k and ?/to 7i and /zo/d. 
Then for any sufficiently small constant C\ > f/zere exists c > depending only on C\ such that 
with probability at least 1 - ra - cl °g lo s™ 

f] a; tl\ 1Ct log logn 

|m p (£ + i) - m n (E + i) | < v s ; _ . (82) 

m 



7n particular, for any sufficiently small constant C\ > 

|m p (E + z) - m n (E + i) \ < ciVk + 1. (83) 

Moreover, we also have 

min |1 + r*{B (j) - (E + > c x (84) 

l<j<N J 

holds for all n large enough. 

Proof. Let {fj>k}kZi an d {vkj^Zi denote the eigenvalues and corresponding normalized eigenvec- 
tors of Buy In the following we use that the eigenvalues of Bu) and B n interlace (see page 82 of 
0). Given that we are on the set 71 H X (see (J30]) and (O), we have 



n—l i j, in n— 1 



*». |2 



fe=l 

c || rj || 2 = c||Ta;|| 2 > c||a;|| 2 > c. (85) 



A similar calculation together with (|50l ) implies Q : m n (-E'-|-i), ^mys (-B+i) > c, hence, recalling 
the definition (1771 ), 

M = M(£ + i) = max (\\{m n {E + z)T + J) -1 !], max \\{m U) (z)T + < C. (86) 

Recalling the constant CV from (1331) . we set 

floe;n)t c ' Tloglogn 
8 V ■= 1 ^ • (87) 
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Using M as just estimated, for n large enough the extra factors of logn in s\ will exceed the 
constant necessary to satisfy the condition of Lemma |3.121 Thus, by Lemma [3 .121 and (1331) , 



\dAE + i)\ < si, j = l 



,...,N, 



with probability at least 1 — Cn lo s lo g n . Returning to (1251) and using the lower bound on the 
denominators given by (1851) . 



JV 



(£ + i)iV 1 + rj(%) - (E + i)J)-i ri 

Using equation (TT9l) and recalling that {ti, . . . , t n } are the eigenvalues of T, 

1 



< Csi. 



(88) 



m n (£ + i) 



(E + i)m n (E + i)) -(E + i) 

flnv(\)d\ 



(89) 



< 



m n (E + i) 



E + i J 1 + Xm n (E + ( 



E + in 



T I 

^ 1 + t k m n 



+ 



1 1 



n 

y — - 

f-? 1 + t k m„ 



k m n (E + i 
1 



E + in k Ti 1 + f ^n( E + E + i J 1 + Xm n (E + i 
For (|90b we use ([25]) and (EH) to obtain 

1 1 JL 1 

< Csi. 



^h™{X)dX 



(90) 
(91) 



^ . 1 1 <A 1 

m n [E + i) > 



For (|9TT) we use ( 1331 ), both parts of Lemma l3.1 11 and the bounds ( |52l , (|86l ) to obtain 



1 1 



iv - 

n z — ' 1 + turn* 



1 



C (logn) Crloglogn 
< h v ; 



■fi nv (X)dX 



< 



n n 
(\ogn) CTlog logn 



n 



Since si > n 1 (\ogn) CTloglogn and s\ < ciy/n if n is sufficiently large, we have 

© < Csi, 

so that by inequality (l48l) of Lemma [3T9l we have 

|m p (£ + i) - m n (E + i)\< Cs 1 < ^—2- ' 



n 



(92) 



(93) 
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This yields (1821) and for n large enough we also get (|83l ). From (1931 ), (1521 ) and (|8T| ) we have the 
bound 

min \l + r]{B U) -{E + i)I)- 1 r j \ > — l — 

l<j<7V' 3K U> Jl 4:Cfl + 

with probability at least 1 — n~ cloglogn . Choosing c\ small enough, this yields (|84"1) . □ 
Lemma 3.16. Assume E G and (A+ — ^tj) (^+T ~~ ^-) — K an< ^ choose an rj' G 



-2 



[rt 4 k; 2 (logn) 4C,Tloglogn , 1], where Ct is the constant appearing in (|33~T> . 77" := 77' — 77 
Assume that 71 an J A" /zo/J an J ?/za? vwY/z a sufficiently small constant c± 

\m p (E + irj) - m n (E + irj)\ < c^K + 77 (94) 

holds for all t] G [r]', 1] with some probability at least 1 — P(n). Then with a probability at least 
1 - P(n) - n - cloglogn , we have 

(\oe:n) 2CT lo s lo s n 

\m p (E + ir]) -m n (E + irj) | < v 6 \ (95) 

for all 7] G [t]", 1]. In particular, provided 77" > 72~ 1 K~ 2 (logn) 4C ' Tloglogri , the bound (O holds for 
all 7} G [77", 1], with a probability at least 1 - P(n) - n - cl °g lo g". 

Proof. By a trivial continuity argument, first we prove that 



\m p (E + irj) - m n (E + irj) | < 2ci^k + t] (96) 

holds for all 77 G [77", 1] with probability at least 1 — P{n). Indeed, the functions m p (E + 777) 
and m n (E + 777) are Lipschitz continuous for fixed E and 7/ G [77", 1] with derivatives bounded 
by (I/77") 2 . In particular, the derivative of m n with respect to 77 is uniformly bounded on 
[n _1 «;- 2 (logn) 4CTloglogn , 1] by Cti 2 k 4 (log 77) - 8CTloglogn . By this continuity and the proximity 
of m n (E + ir] 1 ) and m p (E + ir]') given by (|94|) . the inequality (l96l) holds for all 77 G [77", 1]. 

Now we show that the stronger estimate (l94l) can be regained from the weaker estimate (l96l) for 
all r] G [77", 1]. Assuming that c\ is sufficiently small, Lemma [3 .131 the first inequality of (|96| ) and 
inequality (1551) of Lemma l3 . 1 01 along with the spectral theorem give the bound M{E-\-irj) < C for 
all 77 G [77", 1]. Recalling the definition of s ri from (1871) . we use the bound on M and Lemma 13 .121 
for large enough 77 to obtain 

max \dj(E + iri)\ < s, n 

1<3<N 

with probability at least 1 - P(n) - n ~ cloglogn for all 77 G [77", 1]. 

Using (|52|) and (l96l) . we get that |m n (_E' + 777) | > |c for all 77 G [77", 1] if c\ is sufficiently small, 
where c is the constant from Lemma [3.101 Then from Lemma [3 .141 we see that, with probability 
at least 1 - P{n) - n ~ clogXogn , 

min II + r*(B (i) - (E + 777)) _1 r,-| > c (97) 

l<j<N 

with some positive constant d. Using (|97l ) to bound the denominators and an argument just like 
the one used for the point E + i in Lemma l3.15l 



m " (£ + ! " ) " ' A( 1 -i-i(j + „K(« + „) H£ + „) fc(A ) iA 
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< Cs„ (98) 



for all i] G [77", 1]. By Lemma [3791 we obtain 

CS„ n ogn )2C T loglogn 

\m p (E + it]) -m n (E + irj) \ < - I < v 6 \ (99) 

for all 77 G [77", 1] with probability at least 1 - P(n) - n~ cloglogn . Since 

77 > 77" > n-V- 2 (logn) 4CTloglogri , 

we have 

s v < ( K + r7)(logn)- 2CTloglog ", 
which yields (|94|) with a very high probability for all 77 G [77", 1] and for sufficiently large n. □ 

Proof of Theorem 13.21 The proof of i) follows directly from the previous two lemmas. Set 77 p = 
1 — prr 2 , p = 0, 1, 2, . . ., then (fT3l) is proved in Lemma [3 .151 for 77 = 1 with probability at least 



n 



- c log log n 



. Lemma [3 .161 then implies that 

Qoe ti) 2Ct lo s lo § n 

\mJE + ir>) -m n (E + ir>)\ < v (100) 

holds for all 77 G [771,770 = 1] with the possible exception of a set of probability 2n~ cloglogri . 
Iterating Lemma |3 .161 we get that (1 1001) holds for all 77 G [rj p , 1] with probability at least 1 — (p + 
l)n-dogiogn as long as Vp > n -i K -2( logri )40riogio g n This proves me first part of Theorem [372] 

for a fixed energy E. To take care of all energies simultaneously, we use that the derivative of m n 
is uniformly bounded by Cr]~ 2 . Thus we can discretize the energy range to Cn 2 points and take 
the union bound to obtain i) of Theorem 13 .21 
Now we prove ii). Set 



1 1 n 

p v (E) = -Sm n (E + it]) = — J2 



IT TT71 

k= 



- (]i k -E) 2 + V 2 - 
Given Lemma [3771 the argument for Corollary 2.2 in fl4) gives 

P ( SUp Pv (E) > K ) < 1 - C7277^ logl ° grl 

where K is the constant in Lemma [3771 Using this inequality, ii) follows from the argument given 
to prove the analogous claim in Corollary 4.2 of JH. □ 

Proof of Lemma [3. 14j From d23l we have 

mJz) = , ^— —-(1-A-B) (101) 

where 

A 1 ys MXgQ) ~ Zl)- 1 ~ hTT(B n - Zl)- 1 

Njr{ l + r*(B U) -zI)-i rj 

25 



and 



N 



J'=l 



(B b -) - zlY'r, - ^txT(B U) - zl)' 
l + T*AB U) -zI)-i rj 



We first give a bound on A. Using (1221) and the resolvent identity we have 

1 



trT(£ n - ziy 1 - trT(B U) - zl) 



-i 



l + r?(%)- 



Using Lemma [3781 



E-r*^.)-^)- 1 ^)-^)- 1 ^ 



n 



<-^tr|fi (i) -z/|- 2 <— , 



(102) 



which for large n can be made smaller than any constant. Similarly, 



2 )mI 2 < ^tr|B w -*I|- 



< 



n 2 V ?? 2 

c 

(nrj) 3 / 2 



which is also smaller than any constant for large n. Using the resolvent identity, Lemma B.2 of jHl 
and a union bound, for any constant e > 0, 



max 



(103) 



< 

< n -cloglogn 



-tiT(B n - zl)' 1 - -txT(B m - zl)' 1 > e J 
n n y ' ) 

\*T(B n - ziy l r jr *{B {j) - ziy 1 - EirJ(S W - ziy 1 T(B^ - ziy 1 ^ > 



max 

\i<j<JV 



Now we look at £?. By Lemma [3781 ||T(5(j) — zl) 1 \\hs < C 'y^- hi this case, again for any 
constant e > and for n large enough, 

v^ mBw -zin\ HS <c<^<, 



nrj 



Therefore, by Lemma B.2 of [8 

P 



Lmax \r*(B w - z/) -1 ^ - ^trT(B {j) - ziy 1 ] > ej < n - clogl ° sn . 



(104) 



If |1 + r*AB(j) - ziy 1 ^ > c, then by setting e = c 2 in (11031) and (1 1041) , with probability at least 
1 _ n - c fogio g n 5 wehave \ z \-i(l _ |4| _ > (2fi + y\ so that, by (fTOTT). 



< 2fi + \m n (z) 



1 + itrT(5 n - zl)- 1 
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By ([1031) and ([1041) . with probability at least 1 - n - clogl ° sn 

1 



max 

l<j<N 



l + r*(B {j) -zI)-h 



< 4:fx + \m n (z)\. 



Proof of Theorem 13.31 Recall that Xm\ denotes the n x (N — 1) matrix obtained by removing the 
first column of X. Consider the component V\ of v. Following the proof of Theorem 1.2 in Il3l, 



1 + xJT-3X (1) (/i - Xl 1) T- 1 X {l) )- 2 X* m T- 1 *x 1 



'(!)- 



(105) 



Let T 2 X(x) have the singular value decomposition 



n-l 



T-^X {l) = Y,\J^Sk<- 



k=l 



subintervals of length rj. If /i is in the interval I a = [a — |, a + |], then 



When 7i and ^ occur, min(/i^ i ' 1 , . . . , /^i-i) ^ 1°-(1 — 7&) 2- * n ®' we now P ar tition into 



(TT05T) 



< 



< 



E 



n-l (1)| * I 



\k:X k £l a 



-1 



^E 



|U fc Xl| 



so that, setting £ fc = | v^x^Mil 



\k:\ k ela ^ 



k:\kdla 



2 a.(l-6-') a )V 



(106) 



The eigenvalues of T^X^^T-a and rkPT^ interlace (see page 82 of [2]), so that the 
number of eigenvalues of each matrix in an interval differs at most by two. We use Lemma [3~!6l 
and an argument similar to the proof of Lemma [3771 to obtain the optimal t, t = 772, and a bound 
for (11061) . We partition 8^ into at most n subintervals of the form I a and take the union bound for 
the terms of the form (11061) corresponding to the subintervals and the bn indices. □ 



4 General MANOVA Matrices 

Now we turn to the proof of Theorem 12.21 which follows easily from Theorem 13 .21 



27 



Proof of Theorem 12.21 We recall the observations that 

(XX* + YY*)~*YY*(XX* + YY*)~* (107) 

and 

(YY*f>(XX* + YY*)- l (YY*fi (108) 

have the same eigenvalues and 

(YY*)^(XX* + YY*y l (YY*fi = (I + (YY*y^XX*(YY*)^)~\ 

Note that by factoring YY* as two factors of (YY*)^ rather than Y and Y*, we avoid chang- 
ing the number of zero eigenvalues. If (YY*)~? XX*(YY*)~2 has eigenvalues /Ui,/i 2 , . . . fj, n , 
then (11081) . and hence (11071) . has eigenvalues A& = (1 + ^k)" 1 , k = 1,2, ... ,n, i.e. the corre- 
spondence A = (1 + fiy 1 maps one set of eigenvalues into the other one. Since the eigenvalues 
of (YY*)~^XX*(YY*)~^ are positive, this correspondence is regular. The correspondence also 
maps the interval [/*_,//+] to [A_,A+] and the regions £^ to and £;t' to The same 

transformation gives the correspondence between f p and f M : 



An easy calculation shows the relation between the Stieltjes transforms. Setting z' = z^ 1 — 1, after 
a change of variables, we have 

m P (z') = I „, f P (v)dv = - [ ^ f M (X)d\ = -z - z 2 m M (z). 

Similarly 

^"n,p(^ 



fi-z' JIJyr ' r J A 



1 \ ^ 1 Z \ ^ Afc 2 f \ 

- 7 = 1, 1 = -z-z m nM (z). 

n ' * in. — y' ii ' ' \i. — 7 



so 



m nM (z) - m M (z) = -z 2 (m n>p (z') - m p (z')). (109) 

(Strictly speaking, we defined Stieltjes transforms m(z) for z G C + , but the formula © clearly 
defines it for all z 6 C \ R and we have m(z) = m(z).) Since the sets and £n are separated 
away from zero, we always have a positive lower bound on z and z'. Notice that Qz' = — \z\~ 2 Qz, 
therefore Qz' and Qz are comparable. Thus part i) of Theorem [272] follows from (11091) and part i) 
of Theorem 13 .21 One can similarly conclude part ii) of Theorem 1 2 . 2 1 from part ii) of Theorem 13 .2[ 
This proves Theorem 12 .2[ □ 

A Computing tum 

We refer to ifTOl for the calculation of m M (z). There the Stieltjes transform of 



ft \ a xr \ , V( x ~ X -)( X +~ X ) r /, b-^xt 1> 
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is obtained, which we will denote rrif(z) here. The result is 



* - 4s + aA 2 - ( 2 4t; - rzsW> + (^) 2 

a+o \/ \ a+b (a+b)^ > \ a+o ' 



m '« = 2,(1 - ,) ' 10) 

This is seen by setting p = and q = ^ in the notation of iTTOl (note, though, that the point 
mass at zero in f(x) has been removed in Ifl5l0 . We then have that 

i \ i i\ f f \ a 1 6—11 
m M {z) = [a + b) I mf{z) + 



a + 6z a + 6 1 — z 
Inserting (II 101) into this formula, we obtain £7]). 
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